SemText: a semantically enriched information retrieval system for biology

نویسندگان

Sophia Ananiadou

Philip Cotter

Chikashi Nobata

Naoaki Okazaki

Brian Rea

Yutaka Sasaki

Yoshimasa Tsuruoka

Jun’ichi Tsujii

چکیده

SemText draws upon a number of core technologies from the NaCTeM text mining tool kit to enhance automated detection and mark-up of biologically important terms appearing in text, such as gene/protein names. One of these tools is AcroMine which disambiguates acronyms based upon the context in which they appear. This functionality plays a key role in searching large document collections by allowing users to expand their queries and to include synonymous acronyms without losing the specificity of the original query.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semiautomatic Image Retrieval Using the High Level Semantic Labels

Content-based image retrieval and text-based image retrieval are two fundamental approaches in the field of image retrieval. The challenges related to each of these approaches, guide the researchers to use combining approaches and semi-automatic retrieval using the user interaction in the retrieval cycle. Hence, in this paper, an image retrieval system is introduced that provided two kind of qu...

متن کامل

Personalized Faceted Navigation in Semantically Enriched Information Spaces

Existing information retrieval systems provide users with limited support for efficient navigation in large semantically enriched information spaces. Several possible solutions were proposed, such as using faceted metadata search or semantic clusters of search results. We explore the possibilities of using enhanced faceted navigation with support for personalization, collaboration and Semantic ...

متن کامل

Semantic Search in Documents Enriched by LOD-based Annotations

This paper deals with information retrieval on semantically enriched web-scale document collections. It particularly focuses on web-crawled content in which mentions of entities appearing in Freebase, DBpedia and other Linked Open Data resources have been identified. A special attention is paid to indexing structures and advanced query mechanisms that have been employed into a new semantic retr...

متن کامل

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل